Model Selection

High-resolution human segmentation

# High-resolution human segmentation

Sapiens Seg 0.6b Bfloat16

Sapiens is a family of Vision Transformer models pre-trained on 300 million 1024x1024 resolution human images, focusing on human-centric vision tasks.

Image Segmentation English

Sapiens Seg 0.3b

Sapiens is a family of Vision Transformer models pre-trained on 300 million 1024×1024 resolution human images, focusing on human-centric vision tasks.

Image Segmentation English

Sapiens Seg 0.3b Torchscript

Sapiens is a family of vision Transformer models pre-trained on 300 million 1024 x 1024 resolution human images, supporting 1K high-resolution inference, demonstrating exceptional generalization to real-world data even with scarce or entirely synthetic labeled data.

Image Segmentation English

Sapiens Seg 1b Torchscript

Sapiens is a series of vision transformers pre-trained on 300 million 1024×1024 resolution human images, specifically designed for human-centric vision tasks with exceptional generalization capabilities.

Image Segmentation English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase